Creating Ontologies from Web documents
نویسنده
چکیده
In this paper we present a methodology to build automatically an ontology, extracting information from the World Wide Web from an initial keyword. This ontology represents a taxonomy of classes and gives to the user a general view of the kind of concepts and the most significant sites that he can find on the Web for the specified keyword's domain. The system uses intensively a publicly available search engine, extracts concepts (based on its relation to the initial one and statistical data about appearance) and represents the result in a standard way.
منابع مشابه
OntoMiner: Bootstrapping and Populating Ontologies from Domain Specific Web Sites
HTML documents, which are designed primarily for human consumption. The presence of such legacy documents makes embracing the Semantic Web vision difficult.2 Thus, we need scalable solutions to automatically transform legacy HTML to Semantic Web documents. Recent work describes algorithms that automatically annotate HTML documents with semantic labels.3 Unfortunately, constructing the domain on...
متن کاملτOWL-Manager: A Tool for Managing Temporal Semantic Web Documents in the τOWL Framework
Several semantic web-based applications (e.g., ecommerce, e-government and e-health applications) require temporal versioning of ontology instances, in order to represent, store and retrieve time-varying ontologies. However, commercial systems do not provide any support for creating and updating temporal ontologies. In this paper, we propose a prototype system, named Temporal OWL 2 Web Ontology...
متن کاملSemantic Web: A state of the art survey
The semantic web is an extension of the current web in which information is given well-defined meaning. It is a concept that enables better machine processing of information on the web, by structuring documents written for the web in such a way that they become understandable by computers. This can be used for creating complex applications such as intelligent browsers, intelligent software agen...
متن کاملClassification of Web Documents Using Concept Extraction from Ontologies
In this paper, we deal with the problem of analyzing and classifying web documents in a given domain by information filtering agents. We present the ontology-based web content mining methodology that contains such main stages as creation of ontology for the specified domain, collecting a training set of labeled documents, building a classification model in this domain using the constructed onto...
متن کاملUse of Linked Data principles for semantic management of scanned documents Emprego dos princípios Linked Data para gestão semântica de documentos digitalizados
The study addresses the use of the Semantic Web and Linked Data principles proposed by the World Wide Web Consortium for the development of Web application for semantic management of scanned documents. The main goal is to record scanned documents describing them in a way the machine is able to understand and process them, filtering content and assisting us in searching for such documents when a...
متن کاملτOWL: A Framework for Managing Temporal Semantic Web Documents
The World Wide Web Consortium (W3C) OWL 2 Web Ontology Language (OWL 2) recommendation is an ontology language for the Semantic Web. It allows defining both schema (i.e., entities, axioms, and expressions) and instances (i.e., individuals) of ontologies. OWL 2 ontologies are stored as Semantic Web documents. However, OWL 2 lacks explicit support for time-varying schema or for time-varying insta...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004